Elimination Forest Guided D Sparse LU Factorization
نویسندگان
چکیده
Sparse LU factorization with partial pivoting is important for many scienti c applications and delivering high perfor mance for this problem is di cult on distributed memory machines Our previous work has developed an approach called S that incorporates static symbolic factorization supernode partitioning and graph scheduling This paper studies the properties of elimination forests and uses them to guide supernode partitioning amalgamation and execu tion scheduling The new design with D mapping e ec tively identi es dense structures without introducing too many zeros in the BLAS computation and exploits asyn chronous parallelism with low bu er space cost The imple mentation of this code called S uses supernodal matrix multiplication which retains the BLAS level e ciency and avoids unnecessary arithmetic operations The experiments show that S improves our previous code substantially and can achieve up to GFLOPS on Cray T E MHz nodes which is the highest performance reported in the lit erature
منابع مشابه
Using Postordering and Static Symbolic Factorization for Parallel Sparse LU
In this paper we present several improvements of widely used parallel LU factorization methods on sparse matrices. First we introduce the LU elimination forest and then we characterize the L, U factors in terms of their corresponding LU elimination forest. This characterization can be used as a compact storage scheme of the matrix as well as of the task dependence graph. To improve the use of B...
متن کاملEfficient Sparse LU Factorization with Lazy Space Allocation
Static symbolic factorization coupled with D supernode partitioning and asyn chronous computation scheduling is a viable approach for sparse LU with dynamic partial pivoting Our previous implementation called S uses those techniques and achieves high giga op rates on distributed memory machines This paper studies the space requirement of this approach and proposes an optimization strategy calle...
متن کاملEecient Sparse Lu Factorization with Partial Pivoting on Distributed Memory Architectures
A sparse LU factorization based on Gaussian elimination with partial pivoting (GEPP) is important to many scientiic applications, but it is still an open problem to develop a high performance GEPP code on distributed memory machines. The main diiculty is that partial pivoting operations dynamically change computation and nonzero ll-in structures during the elimination process. This paper presen...
متن کاملParallel Sparse LU Factorization with Partial Pivoting on Distributed Memory Architectures
Gaussian elimination based sparse LU factorization with partial pivoting is important to many scientiic applications, but it is still an open problem to develop a high performance sparse LU code on distributed memory machines. The main diiculty is that partial pivoting operations make structures of L and U factors unpredictable beforehand. This paper presents an approach called S for paralleliz...
متن کاملS+: Efficient 2D Sparse LU Factorization on Parallel Machines
Static symbolic factorization coupled with supernode partitioning and asynchronous computation scheduling can achieve high giga op rates for parallel sparse LU factorization with partial pivoting This paper studies properties of elimination forests and uses them to optimize supernode partitioning amalgamation and execution scheduling It also proposes supernodal matrix multiplication to speed up...
متن کامل